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* NOTICES * 

JPO and NO I PI are not responsible for any 
damage 8 caused by the use of this translation. 

IThis document has been translated by computer. So the translation may not reflect the original 
precisely. 

2 **** shows the word which can not be translated. 
3,In the drawings, any words are not translated. 



CLAIMS 



[Utility model registration claim] 

[Claim 1] In the voice recognition unit which will output the talk back signal corresponding to this 
sound signal, and will answer with voice if the signal of the voice which the speaker uttered 
inputs The language judging section used which judges the language from the sound signal 
inputted as the voice input section (1) at least (2), The speech recognition section (3) which 
distinguishes the contents of the inputted sound signal; and the control section which controls a 
device according to a speech recognition result (4), It has the reply signal generating section for 
an audio response (5). The speech recognition section (3) Two or more languages can be 
recognized from the inputted sound signal. Moreover, the reply signal generating section (5) The 
audio response by two or more languages has become possible, and the judgment signal 
outputted from the language judging section (2) used inputs into the speech recognition section 
(3). The voice recognition unit characterized by for the judgment signal which recognition object 
language was selected and was outputted from the language judging section (2) used inputting 
into the reply signal generating section (5), and being constituted so that audio response 
language may be selected. 

[Claim 2] In the voice recognition unit which will output the talk back signal corresponding to this 
sound signal, and will answer with voice if the signal of the voice which the speaker uttered 
inputs The speech recognition section which distinguishes the contents of the sound signal 
inputted as the voice input section (1) at least (3), it has the control section (4) which controls a 
device according to a speech recognition result, the reply signal generating section for an. audio 
response (5), and the setting section (6) of the language used. The speech recognition section 
(3) Two or more languages can be recognized from the inputted sound signal. Moreover, the reply 
signal generating section (5) The audio response by two or more languages has become possible. 
Either [ at least ] said speech recognition section (3) or said reply signal generating section (5) 
The voice recognition unit characterized by being constituted according to the selection signal 
outputted from the language setting section (6) used so that recognition object language or audio 
response language may be chosen. 



[Translation done.] 
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DETAILED DESCRIPTION 



[Detailed explanation of a design] [0001] 
[Industrial Application] 

This design is related with the voice recognition unit which answers with voice to utterance of a 
speaker. While operators, such as an automobile, operate, when devices* such as an air 
conditioner, audio equipment, car navigation, a land mobile radiotelephone, and auto-cruise, are 
operated manually, trouble is caused to a safety operation and there is a possibility of causing 
accident Then, it is desirable to direct actuation of these devices with voice, and to answer with 
voice to the directed actuation at a device side, and for a speaker to enable it to check because 
of the improvement in operability. 
[0002] 

[Description of the Prior Art] 

Not only the object for automobiles but a voice recognition unit operates a device according to 
the contents which have recognized a speaker's voice and have been recognized further. And a 
device side to tone is emitted to a speaker, and a result is answered. 
[0003] 

Drawing 5 is a flow chart which shows the response actuation in the conventional voice 
recognition unit If it is the case where audio equipment is operated, and it will utter with a 
"play", voice input will be carried out in step SI and playback actuation will be directed first, in 
step S2, a sound signal wii! be inputted through a microphone, and pretreatment will be 
performed so that it may be suitable for speech recognition at step S3 
[0004] 

Subsequently, as shown in step S4, in the speech recognition section, speech recognition is 
performed and the judgment of a recognition result is performed in step S5. When the word 
"play" has been recognized, after answering with an audible tone etc. and telling a speaker about 
what has been recognized in step S6, play actuation is started in step S7 
[0005] 

On the other hand, when a speaker s words cannot be recognized, in step S8, the tone which 

tells that is outputted and a speaker is urged to carry out voice input again 

[0006] 

[Probtem(s) to be Solved by the Device] 

However, in the response by such an audible tone etc., only those about two kinds which were 
able to carry out speech recognition or are not made can be expressed. Since in the case of an 
automobile etc many devices are equipped and it is necessary to operate much switches, the 
contents of the actuation performed by switch actuation are also various. Therefore, it is 
desirable for a response to be also able to answer with voice concretely according to the 
contents of the operator guidance instead of an audible tone. 
[0007] 

For example, for example like "playing a cassette tape", if the directions voice is recognized and 
a play is moreover possible when a speaker utters with a "play", when speech recognition 
cannot be answered and carried out with voice, if an audio response (talk back) can be carried 
out device actuation can be more correctly performed to a dialogue and operability will improve 
like "talk once again," 
[0008] 

Moreover, about conditions, such as temperature and airflow, if a speaker can be told about 
conditions, such as the present rate, by the audio response in the case of auto-cruise, more [ in 
the case of an air conditioner ] will become simple [ actuation ]. 
[0009] 

By the way, although speech recognition is performed in Japanese, it usually needs to make 
possible speech recognition of the case of an export vehicle in a partner language. Moreover, 
also when answering with voice like this design, it is necessary to answer in a partner language 
about the car for export 
[0010] 
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However, in order to enable it to perform speech recognition and the talk back in partner 
languages, such as English, only about an export vehicle, it is necessary to make speech 
recognition of English possible and to develop independently the recognition equipment which 
can moreover also perform the tafk back in English, and there is a problem of becoming cost 
quantity. 
[0011] 

On the other hand, when internationalization progressed and a rental car etc. is assumed, if 
Japanese people may operate the same car, a foreigner may operate, in such a case — 
Japanese — English — speech recognition — if the talk back cannot be carried out, the desired 
end cannot be attained but it is inconvenient 
[0012] 

The technical technical problem of this design is to improve user-friendliness paying attention to 
such a problem by making speech recognition and an audio response possible by two or more 
languages beforehand, and realize the cheap high equipment of versatility. 
[0013] 

[Means for Solving the Problem] 

Drawing 1 is a block diagram explaining the basic configuration of the voice recognition unit by 
this design. The voice recognition unit of claim 1 is this sound signal, when the signal of the 
voice which the speaker uttered inputs. 

It is alike, a corresponding talk back signal is outputted, and it has the voice input section 1, the 
language judging section 2 used which judges the language from the inputted sound signal, the. 
speech recognition section 3 which distinguishes the contents of the inputted sound signal, the 
control section 4 which controls a device according to a speech recognition result, and the reply 
signal generating section 5 for an audio response at least like drawing 1 (1) in the voice 
recognition unit which answers with voice. 
[0014] 

The aforementioned speech recognition section 3 can recognize two or more languages from the 
inputted sound signal, and the reply signal generating section 5 can carry out the audio response 
of it in two or more languages. And the judgment signal which the judgment signal outputted from 
the language judging section 2 used inputted into the speech recognition section 3, and 
recognition object language was selected, and was outputted from the language judging section 2 
used inputs into the reply signal generating section 5, and it is constituted so that audio 
response language may be selected. 
[0015] 

The talk back signal corresponding to this sound signal is outputted, ft sets to the voice 
recognition unit which answers with voice, and claim 2 is drawing 1 (2), when the signal of the 
voice which the speaker uttered inputs. 

** — it has the voice input section 1, the speech recognition section 3 which distinguishes the 
contents of the inputted sound signal, the control section 4 which controls a device according to 
a speech recognition result the reply signal generating section 5 for an audio response, and the 
setting section 6 of the language used at least like. 
[0016] 

The speech recognition section 3 can recognize two or more languages from the inputted sound 
signal, and is a reply signal. 

The audio response by two or more languages is possible for the generating section 5. And 
according to the selection signal outputted from the language setting section 6 used, either [ at 
least ] said speech recognition section 3 or said reply signal generating section 5 is constituted 
so that recognition object language or audio response language may be chosen 
[001 7] 
[Function] 

The speech recognition section 3 of this design has the speech recognition function for every 
language, and has the composition that the speech recognition function of the language judged 
by the judgment signal from the language judging section 2 used operates. Therefore, two or 
more languages can be recognized from the inputted sound signal. 
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[0018] 

Moreover, since the reply signal generating section 5 has the audio response function according 
to operating instructions for every language, If the language is automatically judged from the 
inputted sound signal in the language judging section 2 used, a judgment signal inputs into the 
speech recognition section 3 and the reply signal generating section 5 and the language used is 
specified In the speech recognition section 3, speech recognition only of the specified language 
is chosen and carried out in the reply signal generating section 5, the specified language is 
chosen and an audio response is performed. 
[0019] 

Thus, since voice only inputs from the voice input section 1, the language is judged and speech 
recognition and an audio response are performed in the language, speech recognition and when it 
can carry out an audio response and the equipment for export and the race of two or more 
nations use it, it is convenient at two or more languages. Moreover it is not necessary to 
consider as equipment with the export place only for languages, and since versatility is high, it is 
cheaply realizable* 
[0020] 

In addition, the first sound signal inputted by the initial state in the language judging section 2 
used 

Although it comes out the classification of language is judged and there are technique to which 
only the language is confirmed henceforth, and the technique of judging the classification of the 
language whenever a sound signal inputs, and inputting a judgment signal into the speech 
recognition section 3 and the reply signal generating section 5 t and specifying the language used, 
this design shall contain both. 
[0021] 

The speech recognition section 3 in claim 2 can also recognize two or more languages from the 
inputted sound signal, Moreover, the reply signal generating section 5 The audio response has 
become possible by two or more languages, and the setting signal outputted from the language 
setting section 6 used inputs into the speech recognition section 3. The setting signal which 
recognition object language was chosen or was outputted from the language setting section 6 
used inputs into the reply signal generating section 5, and it has the composition that audio 
response language is chosen. 
[0022] 

Therefore, since it is not necessary to operate it unless the language used will be changed, once 
it does so the same effectiveness as the case of claim 1 since it can respond to two or more 
languages, although the actuation which chooses the language used beforehand artificially is 
required, and also it sets up the language used and constituting becomes simple in favor of little 
[ the frequency of modification of the language used ] equipment, it can realize more cheaolv 
[0023] 

[Example] Next an example explains how the voice recognition unit by this design is materialized 
in practice. Drawing 2 is the block diagram showing the example which carried out the voice 
recognition unit of claim 1 in the automobile. Ahead of the front seat 8 of an automobile 7, 
Microphones 1a and 1b are arranged in the both sides of a drivers side and a passenger side as a 
voice input means. 
[0024] 

And the sound signal detected with Microphones 1a and lb is inputted into the judgment section 
2 of the language used through the pretreatment circuit 9 which performs noise rejection etc., 
and the classification of language in a front seat 8 about which the speaker spoke is 
distinguished In the language judging section 2 used, if a speaker's language used is 
distinguished, the distinction signal will be inputted into the speech recognition section 3 and the 
reply signal generating section 5, and only the judged language will become effective. 
[0025] 

Audio equipment AT, an air conditioner A2, land mobile radiotelephone A3, navigation equipment 

A4, Auto-cruise An, etc. are carried in this automobile as equipment 

[0026] 
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This example shall judge the classification of language with the first sound signal inputted by the 
initial state, and it shall have henceforth composition which confirms only that language. If the 
speaker of a front seat 8 is the case where a device is operated in Japanese, will start an engine 
and will utter with "NIHONGO" first in an initial state now, the sound signal inputted from 
Microphones la or 1b will be inputted into the language judging section 2 used through the 
pretreatment circuit 9, and it will judge that the classification of the language used is 
"Japanese." 
[0027] 

And only Japanese is confirmed by this judgment signal in the speech recognition section 3 and 
the reply signal generating section 5, and comparison collating of the Japanese word thru/or text 
registered into the speech recognition section 3, and the inputted word thru/or text is carried 
out by it 
[0028] 

For example, if it is the case where the cassette tape of audio equipment A1 is played and a 
speaker will utter with a "tape play" first, through the pretreatment circuit 9 and the language 
judging section 2 used, the sound signal inputted from Microphones la or lb will be inputted into 
the speech recognition section 3, and comparison collating will be carried out with the dictionary 
of Japanese registered into the speech recognition section 3. 
[0029] 

Although the dictionary of other languages, such as English, is also registered, the table on which 
other languages are registered is not chosen, but only the table on which Japanese is registered 
is chosen by the judgment signal inputted from the language judging section 2 used, and 
comparison collating of it is carried out with the vocabulary registered beforehand. 
[0030] 

Consequently, it is recognized that it is the playback instruction of a KASETO tape, the cassette 
is inserted in the control section 4 according to the recognition signal, playback is possible or a 
check is performed. And if playback is possible, after a response is performed in Japanese from 
a loudspeaker by the sound signal outputted from the reply signal generating section 5, saying "a 
tape is played", playback of a tape will begin with it The case where the cassette is not inserted, 
'the cassette is not inserted. An audio response is performed saying please insert a cassette " 
[0031] 

On the other hand, if it utters with "a tape stop" in a play condition, like the case of the 
aforementioned play directions, a halt is possible at a control section 4, or playback actuation 
will stop, after outputting the reply signal of "suspending a tape" eta from the reply signal 
generating section 5 after a check is performed and carrying out singing of the loudspeaker 
[0032] 

Furthermore, if it utters with "tape ejection" to take out a cassette tape* a cassette tape will be 
ejected after generating the answer tone of "ejecting a tape" by the same principle as the 
aforementioned case. 
[0033] 

After engine starting, if the speaker of a front seat 8 is the case where hand free actuation is 
performed in English, if it utters with "English", the sound signal inputted from Microphones la 
or 1b will input into the language judging section 2 used through the pretreatment circuit 9, and 
the classification of the language used will be judged to be "English/' Only English is confirmed 
by this judgment signal in the speech recognition section 3 and the reply signal generating 
section 5, and comparison collating of the dictionary of English registered into the speech 
recognition section 3 and the sound signal of inputted English is carried out by it. Therefore, the 
hand free actuation of the loading device can be henceforth carried out in English 
[0034] 

Drawing 3 is a flow chart which shows the procedure in the equipment of the example of drawing 
2 , 

Now, in the initial state which started the engine, at step S1, if it utters with "NIHONGO" and 
voice input is carried out toward the microphones 1a or lb of drawing 2 , a sound signal will 
output and noise rejection etc. will be performed by the pretreatment circuit 9 of drawing 2 in 
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[0035] 3 fr ° m Microphones 7a or by step S2. 

iu«ng vocabulary. Jap.„ ese is set up t0 , ^^Sie^SS I»h * T b< "" 8 ' aneuaee 

ET e seo * on 5 at !tep S6 ' ° nd — 

is, sinoe it is the vocabulary of operator command at st J . Sfi . ™" UNGC \. or En S ,,sh * That 

chosen by the speech recognition section Und the semantics of S 136,6 " 

recognized. semantics or inputted Japanese is 

[0037] 

semantic judgment in step S7 in Z sq»n ? St6P S8 as a result of the 

gsaf aid ^ _£_ise s sjsf saying " A ,s 

WW* ^ ' * ' 0U,iS,!83,<9r is drive " - *• ^of'STctu^on ,s urged 

In addition, if an engine is started it utters with "Fn_rii«h" _, • • 
SI in an initial state, since it will be JuSK w£h ^K'' S ZTs^eI^T « 
the speech recognition section 3 and the reply ^n^^^^?^ ™ * e S6tt ° 
voTce input, speech recognition and an audio^X'^^^ % « ' " 

suspended, modification of the language used cannot be perflXf ' Un ' 6SS ^ " 

uL In? f t ^ * • be n ! CeSSa ' y to s " spend an » h <™«' it "hang, Z tongua» 

[0042J 

However, if the vocabulary which i s common in Japanese and English is used ilk* *™« n 
.instructions words, such as a "tape play" and "tape Wt^^^^^^^ 
response ,s performed in Japanese for the audio response to have been performed i n E n gt h to 
have earned out operating instructions in Japanese, or have carried out L r !w '" tng ' lsh to 
English conversely wil, occur, and derangement will ^^^,^S^^^ " 

a tie" or"! t la ? gU ^ 6 JUdgin f SSC i i0n 2 US6d needs to carr > °* a language judghg like " play 
? , J Ct 3 tape USmg the differe "ce in the functor of not only the description of a 

K PaneS6, EngNSh Wlth0Ut ° rdering With 2 WOrd " in orde ' 'to ^pivTni this 9 

When setting up the language used from input voice artificially like claim 2 by the thought of 
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claim 1 to being the example which performs a language judging automatically, the above Like 
drawing.4 f if the language used Is chosen with a switch etc. in step SO, a setting signal will occur 
from the setting section 6 of the language used in drayyjngj_ (2X it will input into the speech 
recognition section 3 and the reply signal generating section 5, and the future language used will 
be chosen. 
[0044] 

Therefore, in step SO, supposing English is set up, when only the operator command in English 
becomes effective and it uses other language, it is necessary henceforth to reset up in step SO. 
In this example, once it sets up, as long as there is no modification of the language used, it is not 
necessary to operate it at all, although it is necessary to operate a language configuration switch 
manually. You may make it set up a language setup with voice not only a manually-operated 
switch but as mentioned above. 
[0045] 

In addition, it is good also as selectable (another side is performed in predetermined language) in 
the recognition object language of the speech recognition section 3, or the audio response 
language of the reply signal generating section 5 t and you may enable it to set up both language 
independently separately, 
[0046] 

not being limited only to this, although the above example is an example which made the two 

languages of Japanese and English usable — natural — moreover, three or more languages 

selection — also suppose that it is usable. 
[0047] 

The candidate for actuation with voice can perform actuation of various kinds of switches, such 
as others [ device / in drawingjZ 1 for example, a wiper etc., with the voice recognition unit by 
this design as much as possible. And it is applicable not only to actuation of the device in an 
automatic in the car on© but actuation of other devices 
[0048] 

[Effect of the Device] 

Since according to claim 1 voice input is only carried out, the language is judged and speech 
recognition and an audio response are performed in the language, speech recognition and when it 
can carry out an audio response and the equipment for export and the race of two or more 
nations use it it is convenient at two or more languages. Moreover, since it is not necessary to 
consider as equipment with the export place only for languages, versatility is high and can be 
realized cheaply. 
[0049] 

Like claim 2, in a configuration of choosing the language used by switch actuation etc. artificially 
beforehand, since actuation will not be required unless the language used is changed once it sets 
up the language used, it is advantageous to little [ the frequency of modification of the language 
used j equipment, and since the configuration is simple, it is cheaply realizable. 
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DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 11 It is a block diagram explaining radical Motohara ** of the voice recognition unit by 
this design. 

[ Drawing 2] It is the block diagram showing the example which carried out the voice recognition 
unit of claim 1 in the automobile. 

[Drawing 3] It is the flow chart which illustrates the procedure in the example equipment of 
drawing 2 , 

[Drawing 4] It is the flow chart which illustrates the procedure in the design of claim 2. 
[Drawing 5] It is the flow chart which shows the response actuation in the conventional voice 
recognition unit 
[Description of Notations] 

1 Voice Input Section 

2 Language Judging Section Used 

3 Speech Recognition Section 

4 Control Section 

5 Reply Signal Generating Section 

6 Language Setting Section Used 
A1-An Various equipment in an automobile 



http://www4jpdl.ncipi.gojp/cgi-bin/tran.web.cgLeije 



2006/07/28 



0 6-0 8-0 1 ; 1 0 : 47AM; ALP I ME LEGAL & IP 



BHG&L-CHI 



# 18/ 22 



1/5 *N— V 



DRAWINGS 



[DrawinK 1] 
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[Drawing 21 
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[Drawing 4] 
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[Drawing 5] 
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